NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Virus ecology and 7‐year temporal dynamics across a permafrost thaw gradient

https://doi.org/10.1111/1462-2920.16665

Sun, Christine L; Pratama, Akbar Adjie; Gazitúa, Maria Consuelo; Cronin, Dylan; McGivern, Bridget B; Wainaina, James M; Vik, Dean R; Zayed, Ahmed A; Bolduc, Benjamin; Wrighton, Kelly C; et al (July 2024, Environmental Microbiology)

Abstract Soil microorganisms are pivotal in the global carbon cycle, but the viruses that affect them and their impact on ecosystems are less understood. In this study, we explored the diversity, dynamics, and ecology of soil viruses through 379 metagenomes collected annually from 2010 to 2017. These samples spanned the seasonally thawed active layer of a permafrost thaw gradient, which included palsa, bog, and fen habitats. We identified 5051 virus operational taxonomic units (vOTUs), doubling the known viruses for this site. These vOTUs were largely ephemeral within habitats, suggesting a turnover at the vOTU level from year to year. While the diversity varied by thaw stage and depth‐related patterns were specific to each habitat, the virus communities did not significantly change over time. The abundance ratios of virus to host at the phylum level did not show consistent trends across the thaw gradient, depth, or time. To assess potential ecosystem impacts, we predicted hostsin silicoand found viruses linked to microbial lineages involved in the carbon cycle, such as methanotrophy and methanogenesis. This included the identification of viruses ofCandidatusMethanoflorens, a significant global methane contributor. We also detected a variety of potential auxiliary metabolic genes, including 24 carbon‐degrading glycoside hydrolases, six of which are uniquely terrestrial. In conclusion, these long‐term observations enhance our understanding of soil viruses in the context of climate‐relevant processes and provide opportunities to explore their role in terrestrial carbon cycling.
more » « less
Full Text Available
MArVD2: a machine learning enhanced tool to discriminate between archaeal and bacterial viruses in viral datasets

https://doi.org/10.1038/s43705-023-00295-9

Vik, Dean; Bolduc, Benjamin; Roux, Simon; Sun, Christine L.; Pratama, Akbar Adjie; Krupovic, Mart; Sullivan, Matthew B. (August 2023, ISME Communications)

Abstract Our knowledge of viral sequence space has exploded with advancing sequencing technologies and large-scale sampling and analytical efforts. Though archaea are important and abundant prokaryotes in many systems, our knowledge of archaeal viruses outside of extreme environments is limited. This largely stems from the lack of a robust, high-throughput, and systematic way to distinguish between bacterial and archaeal viruses in datasets of curated viruses. Here we upgrade our prior text-based tool (MArVD) via training and testing a random forest machine learning algorithm against a newly curated dataset of archaeal viruses. After optimization, MArVD2 presented a significant improvement over its predecessor in terms of scalability, usability, and flexibility, and will allow user-defined custom training datasets as archaeal virus discovery progresses. Benchmarking showed that a model trained with viral sequences from the hypersaline, marine, and hot spring environments correctly classified 85% of the archaeal viruses with a false detection rate below 2% using a random forest prediction threshold of 80% in a separate benchmarking dataset from the same habitats.
more » « less
VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses

https://doi.org/10.1186/s40168-020-00990-y

Guo, Jiarong; Bolduc, Ben; Zayed, Ahmed A.; Varsani, Arvind; Dominguez-Huerta, Guillermo; Delmont, Tom O.; Pratama, Akbar Adjie; Gazitúa, M. Consuelo; Vik, Dean; Sullivan, Matthew B.; et al (December 2021, Microbiome)
null (Ed.)
Abstract Background Viruses are a significant player in many biosphere and human ecosystems, but most signals remain “hidden” in metagenomic/metatranscriptomic sequence datasets due to the lack of universal gene markers, database representatives, and insufficiently advanced identification tools. Results Here, we introduce VirSorter2, a DNA and RNA virus identification tool that leverages genome-informed database advances across a collection of customized automatic classifiers to improve the accuracy and range of virus sequence detection. When benchmarked against genomes from both isolated and uncultivated viruses, VirSorter2 uniquely performed consistently with high accuracy (F1-score > 0.8) across viral diversity, while all other tools under-detected viruses outside of the group most represented in reference databases (i.e., those in the order Caudovirales ). Among the tools evaluated, VirSorter2 was also uniquely able to minimize errors associated with atypical cellular sequences including eukaryotic genomes and plasmids. Finally, as the virosphere exploration unravels novel viral sequences, VirSorter2’s modular design makes it inherently able to expand to new types of viruses via the design of new classifiers to maintain maximal sensitivity and specificity. Conclusion With multi-classifier and modular design, VirSorter2 demonstrates higher overall accuracy across major viral groups and will advance our knowledge of virus evolution, diversity, and virus-microbe interaction in various ecosystems. Source code of VirSorter2 is freely available ( https://bitbucket.org/MAVERICLab/virsorter2 ), and VirSorter2 is also available both on bioconda and as an iVirus app on CyVerse ( https://de.cyverse.org/de ).
more » « less
Full Text Available
Expanding standards in viromics: in silico evaluation of dsDNA viral genome identification, classification, and auxiliary metabolic gene curation

https://doi.org/10.7717/peerj.11447

Pratama, Akbar Adjie; Bolduc, Benjamin; Zayed, Ahmed A.; Zhong, Zhi-Ping; Guo, Jiarong; Vik, Dean R.; Gazitúa, Maria Consuelo; Wainaina, James M.; Roux, Simon; Sullivan, Matthew B. (January 2021, PeerJ)
null (Ed.)
Background Viruses influence global patterns of microbial diversity and nutrient cycles. Though viral metagenomics (viromics), specifically targeting dsDNA viruses, has been critical for revealing viral roles across diverse ecosystems, its analyses differ in many ways from those used for microbes. To date, viromics benchmarking has covered read pre-processing, assembly, relative abundance, read mapping thresholds and diversity estimation, but other steps would benefit from benchmarking and standardization. Here we use in silico-generated datasets and an extensive literature survey to evaluate and highlight how dataset composition (i.e., viromes vs bulk metagenomes) and assembly fragmentation impact (i) viral contig identification tool, (ii) virus taxonomic classification, and (iii) identification and curation of auxiliary metabolic genes (AMGs). Results The in silico benchmarking of five commonly used virus identification tools show that gene-content-based tools consistently performed well for long (≥3 kbp) contigs, while k -mer- and blast-based tools were uniquely able to detect viruses from short (≤3 kbp) contigs. Notably, however, the performance increase of k -mer- and blast-based tools for short contigs was obtained at the cost of increased false positives (sometimes up to ∼5% for virome and ∼75% bulk samples), particularly when eukaryotic or mobile genetic element sequences were included in the test datasets. For viral classification, variously sized genome fragments were assessed using gene-sharing network analytics to quantify drop-offs in taxonomic assignments, which revealed correct assignations ranging from ∼95% (whole genomes) down to ∼80% (3 kbp sized genome fragments). A similar trend was also observed for other viral classification tools such as VPF-class, ViPTree and VIRIDIC, suggesting that caution is warranted when classifying short genome fragments and not full genomes. Finally, we highlight how fragmented assemblies can lead to erroneous identification of AMGs and outline a best-practices workflow to curate candidate AMGs in viral genomes assembled from metagenomes. Conclusion Together, these benchmarking experiments and annotation guidelines should aid researchers seeking to best detect, classify, and characterize the myriad viruses ‘hidden’ in diverse sequence datasets.
more » « less
Full Text Available
Diversity and ecological footprint of Global Ocean RNA viruses

https://doi.org/10.1126/science.abn6358

Dominguez-Huerta, Guillermo; Zayed, Ahmed A.; Wainaina, James M.; Guo, Jiarong; Tian, Funing; Pratama, Akbar Adjie; Bolduc, Benjamin; Mohssen, Mohamed; Zablocki, Olivier; Pelletier, Eric; et al (June 2022, Science)

Community- and “species”-level analyses elucidate ecological impacts and roles of marine RNA viruses.
more » « less
Full Text Available
Cryptic and abundant marine viruses at the evolutionary origins of Earth’s RNA virome

https://doi.org/10.1126/science.abm5847

Zayed, Ahmed A.; Wainaina, James M.; Dominguez-Huerta, Guillermo; Pelletier, Eric; Guo, Jiarong; Mohssen, Mohamed; Tian, Funing; Pratama, Akbar Adjie; Bolduc, Benjamin; Zablocki, Olivier; et al (April 2022, Science)

Viruses of two candidate phyla are abundant in the ocean and revise our understanding of early RNA virus evolution.
more » « less
Full Text Available

Search for: All records